Google AI Tool Creates Music from Written Descriptions
2023-02-02
TXT
大字
小字
滚动
全页
1This week, Google researchers published a paper describing results from an artificial intelligence (AI) tool built to create music. 2The tool, called MusicLM, is not the first AI music tool to launch. But the examples Google provides demonstrate musical creative ability based on a limited set of descriptive words. 3AI shows how complex computer systems have been trained to behave in human-like ways. 4Tools like ChatGPT can quickly produce, or generate, written documents that compare well with the work by humans. ChatGPT and similar systems require powerful computers to operate complex machine-learning models. The San Francisco-based company OpenAI launched ChatGPT late last year. 5Developers train such systems on huge amounts of data to learn methods for recreating different forms of content. For example, computer-generated content could include written material, design elements, art or music. 6ChatGPT has recently received a lot of attention for its ability to generate complex writings and other content from just a simple description in natural language. 7Google's MusicLM 8Google engineers explain the MusicLM system this way: 9First, a user comes up with a word or words that describe the kind of music they want the tool to create. 10For example, a user could enter this short phrase into the system: "a continuous calming violin backed by a soft guitar sound." The descriptions entered can include different music styles, instruments or other existing sounds. 11Several different music examples produced by MusicLM were published online. Some of the generated music came from just one- or two-word descriptions, such as "jazz," "rock" or "techno." The system created other examples from more detailed descriptions containing whole sentences. 12In one example, Google researchers include these instructions to MusicLM: "The main soundtrack of an arcade game. It is fast-paced and upbeat, with a catchy electric guitar riff. The music is repetitive and easy to remember, but with unexpected sounds..." 13In the resulting recording, the music seems to keep very close to the description. The team said that the more detailed the description is, the better the system can attempt to produce it. 14The MusicLM model operates similarly to the machine-learning systems used by ChatGPT. Such tools can produce human-like results because they are trained on huge amounts of data. Many different materials are fed into the systems to permit them to learn complex skills to create realistic works. 15In addition to generating new music from written descriptions, the team said the system can also create examples based on a person's own singing, humming, whistling or playing an instrument. 16The researchers said the tool "produces high-quality music...over several minutes, while being faithful to the text conditioning signal." 17At this time, the Google team has not released the MusicLM models for public use. This differs from ChatGPT, which was made available online for users to experiment with in November. 18However, Google announced it was releasing a "high-quality dataset" of more than 5,500 music-writing pairs prepared by professional musicians called MusicCaps. The researchers took that step to assist in the development of other AI music generators. 19The MusicLM researchers said they believe they have designed a new tool to help anyone quickly and easily create high-quality music selections. However, the team said it also recognizes some risks linked to the machine learning process. 20One of the biggest issues the researchers identified was "biases present in the training data." A bias might be including too much of one side and not enough of the other. The researchers said this raises a question "about appropriateness for music generation for cultures underrepresented in the training data." 21The team said it plans to continue to study any system results that could be considered cultural appropriation. The goal would be to limit biases through more development and testing. 22In addition, the researchers said they plan to keep improving the system to include lyrics generation, text conditioning and better voice and music quality. 23I'm Bryan Lynn. 24Bryan Lynn wrote this story for VOA Learning English, based on reports from Google. 25____________________________________________________________ 26Words in This Story 27artificial intelligence - n. the development of computer systems that have the ability to perform work that normally requires human intelligence 28style -n. a particular form or design, usually used in comparing forms of art or handiwork 29instruction -n. a description of how to do something 30arcade - n. an area containing many electronic and other coin-operated games 31upbeat - adj. full of hope and happiness 32repetitive - adj. saying or doing something over and over again 33hum - v. to make a musical sound without opening your mouth 34whistle - v. to make a high sound by forcing air through a small hole in the mouth 35faithful - adj. staying firm about an idea or belief 36appropriate - adj. the level to which something is right for a situation 37cultural appropriation - n. when members of a culture in a society, often the main culture, use a practice of another, often minority, culture, without fully understanding the meaning or importance of the practice. 38______________________________________________________________ 39What do you think of this story? We want to hear from you. We have a new comment system. Here is how it works: 40Each time you return to comment on the Learning English site, you can use your account and see your comments and replies to them. Our comment policy is here.